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ABSTRACT We have developed high-density DNA mi- 
croarrays of yeast ORFs. These microarrays can monitor 
hybridization to ORFs for applications such as quantitative 
differential gene expression analysis and screening for se- 
quence polymorphisms. Automated scripts retrieved sequence 
information from public databases to locate predicted ORFs 
and select appropriate primers for amplification. The primers 
were used to amplify yeast ORFs in 96-weIl plates, and the 
resulting products were arrayed using an automated micro 
arraying device. Arrays containing up to 2,479 yeast ORFs 
were printed on a single slide. The hybridization of fluores- 
cently labeled samples to the array were detected and quan- 
titated with a laser confocal scanning microscope. Applica- 
tions f the microarrays are shown for genetic and gene 
expression analysis at the whole genome level 

The genome sequencing projects have generated and will con- 
tinue to generate enormous amounts of sequence data. The 
genomes of Saccharomyces cerevisiae, Haemophilus influenzae (1), 
Mycoplasma genuaUum (2), and Meihanococcus jannisehu (3) 
have been completely sequenced. Other model organisms have 
had substantia] portions of their genomes sequenced as well 
including the nematode Caenomabditis elegans (4) and the small 
flowering plant Arabidopsis thabana (5). Given this ever- 
increasing amount of sequence information, new strategies are 
necessary to efficiently pursue the next phase of the genome 
projects—the elucidation of gene expression patterns and gene 
product function on a whole genome scale. 

One important use of genome sequence data is to attempt 
t identify the functions of predicted ORFs within the genome. 
Many f the ORFs identified in the yeast genome sequence 
were not identified in decades of genetic studies and have no 
significant homology to previously identified sequences in the 
database. In addition, even m cases where ORFs have signif- 
icant h mology to sequences in the database, or have known 
sequence motifs (e.g., protein kinase), this is not sufficient to 
determine the actual biological role of the gene product. 
Experimental analysis must be performed to thoroughly un- 
derstand the biological function of a given ORFs product. 
Model organisms, such as 5. cerevisiae, will be extremely 
important in improving our understanding of other more 
c mplex and less manipulate organisms. 

To examine in detail the functional role of individual ORFs and 
relationships between genes at the expression level, this work 
describes the use of genome sequence information to study large 
numbers of genes efficiently and systematically. The procedure 
was as f Uows. (0 Software scripts scanned annotated sequence 
information from public databases for predicted ORFs. (u) The 
stan and stop position f each identified ORF was extracted 
aut matically, along with the sequence data f the ORF and 200 
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bases flanking either side, (iu) These data were used to automat- 
ically select PCR primers that would amplify the ORF. (rV) The 
primer sequences were automatically input into the automated 
multiplex oligonucleotide synthesizer (6). (v) The oligonucleo- 
tides were synthesized in 96-weD format, and (vi) used in 96-well 
format to amplify the desired ORFs from a genomic DNA 
template, (vil) The products were arrayed using a high-density 
DNA arrayer (7-10). The gene arrays can be used for hybridiza- 
tion with a variety of labeled products such as cDNA for gene 
expression analysis or genomic DNA for strain comparisons, and 
genomic mismatch scanning purified DNA for genotyping (11). 

METHODS 

Script Design, All scripts were written in UNDCTool Command 
Language. Annotated sequence information from GenBank was 
extracted into one file containing the complete nucleotide se- 
quence of a single chromosome. A second file contained the 
assigned ORF name followed by the start and stop positions of that 
ORF. The actual sequence contained within the specified range, 
along with 200 bases of sequence flanking both sides, was extracted 
and input into the primer selection program primer as (White- 
head Institute, Boston). Primers were designed so as to allow 
amplification of entire ORFs. The selected primer sequences were 
read by the 96-wcll automated multiplex oligonucleotide synthe- 
sizer instrument for primer synthesis. The forward and reverse 
primers were synthesized in two separate 96-well plates in corre- 
sponding wells. All primers were synthesized on a 20-nmol scale. 

ORF Amplification and Purification. Genomic DNA was iso- 
lated as described (12) and used as template for the amplification 
reactions. Each PCR was done in a total volume of 100 yl. A total 
of 02 uM each of forward and reverse primers were aliquoted into 
a 96-well PCR plate (Robbins Scientific; Sunnyvale, CA); a master 
mix containing 0.24 mM each dNTP, 10 mM Tris (pH 8-5), 50 mM 
MgCl 2 , 25 units Too polymerase, and 10 ng of template was added 
to the primers, and the entire mix was thermal cycled for 30 cycles 
as follows: 15 min at 94°C, 15 min at 54°C, and 30 min at 72*C 
Products were ethanol precipitated in polystyrene v-bottom 96- 
well plates (Costar). All samples were dried and stored at -20°C 

Arraying Procedure and Processing. Microarrays were 
made as described (8). 

A custom built arraying robot was used to print batches of 48 
slides. The robot utilizes four printing tips which simultaneously 
pick up — 1 /J of solution from 96-well microliter plates. After 
printing, the microarrays were rehyd rated for 30 sec in a humid 
chamber and then snap dried for 2 sec on a hot plate (100*Q. The 
DNA was then UV crosslinked to the surface by subjecting the 
slides to 60 millijoules of energy. The rest of the pory-L-lysine 
surface was blocked by a 15 -min incubation in a solution f 70 mM 
succinic anhydride dissolved in a solution consisting of 315 ml of 
l-methyl-2-pyrrolidinone (Aldrich) and 35 ml f 1 M boric acid 
(pH 8.0). Directly after the blocking reacti n, the bound DNA 
was denatured by a 2-min incubati n in distilled water at — 95°C 

Abbreviation: YEP, yeast extract /peptone. 
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Fi ?'l- Twocolor fluorescent scan of a yeast microarray contain- 
ing 2,479 elements (ORFs). The center-to-center distance between 
elements ts 345 tun. A probe mixture consisting of cDNA from yeast 
extract/peptone (YEP) galactose (green pseudocolor) and YEP glu- 
cose (red pseudocolor) grown yeast cultures was hybridized to the 
array. Intensity per element corresponds to ORF expression and 
pseudocolor per element corresponds to relative ORF exDression 
between the two cultures. 

The slides were then transferred into a bath of 100% ethanol at 
room temperature. 

Probe Preparation: cDNA. Yeast cultures (100 ml) were grown 
to ~1 ODamo and total RNA was isolated as described (13) Up 
to 500 jig total RNA was used to isolate mRNA (Qiaeen 
Chatsw rth, CA). OLgo(aT)20 (5 /ig) was added and annealed to 
2 Mg f mRNA by heating the reaction to 70°C for 10 min and 
quick chilling on ice, phis 2 Superscript D (200 units/ul) (Life 
Technologies, Gaithersburg, MD), 0.6 50x dNTP mix (final 
concentrations were 500 /tM dATP, dCTP, dGTP, and 200 uM 
aTTP), 6 /J 5x reaction buffer, and 60 tiM Cy3-dUTP or 
Qtf-dUTP (Amersham). Reactions were carried out at 42°C for 
2 h, after which the mRNA was degraded by the addition of 0 3 
/d 5 M NaOH and 03 >d 100 mM EDTA and heating to 65°C for 
10 nun. The sample was then diluted to 500 with TE and 
concentrated using a Microcon-30 (Amicon) to 10 /J. 

Probe Preparation: Genomic DNA. Fluorescent DNA was 
prepared fr m total genomic DNA as follows: 1 M g of random 
nonamer ligonucleotides was added to 2.5 jig of genomic 
DNA. This mixture was boiled for 2 min and then chilled on 
ice. A reaction mixture containing dNTPs (25 uM dATP 
dCTP, dGTP, 10 mM oTTP, and 40 M M Cy3-dUTP or" 
Cy5-dUTP) reaction buffer (New England Biolabs), and 20 
units exonudease free KJenow enzyme (United States Bio- 
chemical) was added, and the reaction was incubated at 37°C 
for 2 h. The sample was then diluted to 500 fi\ with TE and 
concentrated using a Microcon-30 (Amicon) to 10 *d. 

Hybridkanon. Purified, labeled probe was resuspended in 1 1 
crJ containing 10 ^Escherichia coii tRNA, and 03% 

SDS. The sample was then heated for 2 min in boiling water 
cooled rapidly to room temperature, and applied t the array The 
array was placed in a sealed, humidified, hybridization chamber. 
Hybridization was carried out for 10 h in a 62°C water bath, after 
which the arrays were washed immediately in 2x SSC/0 2% SDS 
A second wash was perf rmed in 0.1 x SSC 

Analysis and Quantitation. Arrays were scanned on a 
scanning laser fluorescence microsc pe developed by Steve 
Smith with software written by Noam Ziv (Stanford Univer- 
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sjy). A separate scan was done for each of the two flu re 
phores used. The images were then combined for analysis. A 
bounding box, fined to the size f the DNA spots, was placed 
over each array element The average fluorescent intensity was 
calculated by summing the intensities of each pixel present in 
a bounding box and then dividing by the total number of pixels, 
Local area background was calculated for each array element 
by determining the average fluorescent intensity at the edge of 
the bounding box. To normalize for f luorophore-spedficVari. 
ation control spots containing yeast genomic DNA were 
applied to each quadrant during the arraying process. Tnese 
elements were quanritated and the ratios of the signals were 
determined These ratios were then used to normalize the 
photomultipiier sensitivity settings such that the ratios of the 

tTFS? ° f lhc genomic DNA s P° te wcrc dose to a value 
of 1.0. pe average signal intensity at any given spot was 
regarded as significant if it was at least two standard deviations 
above background. Each experiment was conducted in dupli- 
cate with the fluorophores representing each channel re- 
versed. The ratios presented here are the average of the two 
experiments, except in the case in which the signal for the 
element m question was below the reliability threshold. The 
reliability threshold also determined the dynamic range of the 
experiment. For all of the experiments presented, the average 
dynamic range was -1 to 100. In the case where the fluores- 
cence from a very bright spot saturates the detector, differ- 
ential ratios will, in general, be underestimated. This can be 
compensated for by scanning at a lower overall sensitivity. 

RESULTS 

The accumulation of sequence information from model organ- 
isms presents an enormous opportunity and challenge to under- 
stand the biological function of many previously uncharacterized 
genes. To do this accurately and efficiently, a directed strategy 
was developed that enables the monitoring of multiple genes 
S ? U ! l ^f T OUS,y - Microarra y«g technology provides a method by 
which DNA can be attached to a glass surface in a high-density 
format (8). In practice, it is possible to array over 6,000 elements 

m / n /™ ™i han 18 cm2 ' Givcn *« Ac yeast genome consists 
of -6 100 ORFs, the entire set of yeast genes can be spotted onto 
a single glass slide. 

With this capability and the availability of the entire se- 
quence of the yeast genome, our strategy was to use a directed 
approach for generating the complete genome array. This 
procedure involved synthesizing a pair of oligonucleotide 
primers to amplify each ORF. The PCR product containing 
each gene of interest was arrayed onto glass and used for 
example, as probe for monitoring gene expression levels by 
hybridizing to the array labeled cDNA generated from isolated 
mRNA of a culture grown under any experimental condition. 

Primer Selection and Synthesis. The primer selection was fully 

n < ^1 1 001 Cofnmand Language scripts and primer 

0.5. (Whitehead). Pnmer pairs were automatically selected suc- 
cessfully for >99% of the ORFs tested. Primer sequences can thus 
be selected rapidly with minimal manual processing. A complete 
set of forward and reverse primers were selected initially for each 
ORF on chromosomes I, II, III, V, VI, VIII, IX, X, and XI 
Primers for a representative set of ORFs (15% coverage) were 
chosen for the remaining chromosomes. With the release of the 
entire yeast genome sequence, the complete set f primers has 
now been selected. 

Because each ORF requires a unique pair f synthetic primers, 
a total of approximately 12,200 oligonucleotides will be required 
to mdrvidually amplify each target This costly c mponent was 
addressed with the automated multiplex olig nucleotide synthe- 
sizer (6) which efficiently synthesizes primers in a 96-weU f final. 
Each pnmer, synthesized on a 20-nmo! scale, provides enough 
material for 100 amplification reactions, whereas a given PCR 
product provides enough material t generate an element on 
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Table 1. Heat shock v* control expression data 
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YBR009 
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14 


YDR343 


HXT6 




11 


YHR092 


HXT4 


3.6 




YAR071 


rHUli 




13 


YLR096 


KW2 


23 




YER102 


RFS8B 


2.6 




YBR181 


RPSlOl 


16 




YCR031 


CRY1 


2.7 




YLR441 


RPIOA 


18 




YHR141 


RPL41B 


18 




YBL072 


RPS8A 


18 




YHL015 


URP2 


18 




YBR191 


URP1 


3.1 




YLR340 


RPLAO 


33 


5.8 


YGL123 
YLR194 


SUP44 



Description 

Proline oxidase 
Acetylornithine aminotransferase 
Chorismate synthase 
Heat shock protein 
Heat shock protein 

Similarity l0 HSP30 heat shock protein Yrolp 
Heat shock protein 
Heat shock protein 

Mitochondrial heat shock protein HSP60 
Heat shock protein of the HSP70 family 
Cytoplasmic heat shock protein 
Heat shock protein 

Mitochondrial heat shock protein of dpb family of ATP.depcndent proteases 

Heat shock protein 

70-kDa heat shock protein 

Heat shock protein 

Histone transcription regulator 

Histone H2BJ 

Histonc H2A^ 

Histone H3 

Histone H4 

High-affinity hexose transporter 

Moderate- to low-affinity glucose transporter 

Secreted acid phosphatase, 56 kDa isozyme 

Ser/Thr protein kinase 

Ribosomal protein S8.e 

Ribosomal protein S6.e 

40S ribosoroaJ protein Sl4.e 

Ribosomal protein S3.a* 
Ribosomal protein L36a,e 
Ribosomal protein S8.e 
Ribosomal protein 
Ribosomal protein L21.e 
Acidic Ribosomal protein Ll0.e 
Ribosomal protein 
Hypothetical protein 



500-1,000 arrays. Thus, a single primer pair provides enough 
starting material for up to -50,000 arrays. 

Primers were synthesized to amplify yeast ORFs. Primer 
synthesis bad a failure rate of <1% in over 18 plates of 
synthesis as determined by standard trityl analysis (6). The 
success rate of the PCR amplifications using the primer pairs 
was 94% based on agarose gel analysis of each PCR. The 
purifi d PCR products were used to generate arrays. Two 
versions of the arrays were created for the experimental results 
presented here. The first array contained 2^87 elements and 
the second array batch contained 2,479 elements. 

Genome Arrays. The amplified ORFs were arrayed onto glass 
at a spacing of 345 microns (Fig. 1). The high-density spacing of 
DNA samples allows the hybridization volumes to be mini- 
mizeo^-voJurnes are a maximum of 10 pi. The labeled probe can 
thus be maintained at relatively high concentrations, making 1-2 
jig of mRNA sufficient for analysis. This also obviates the need 
for a subsequent amplification step and thus avoids the risk of 
altering the relative ratios of different cDNA species in th 
sample. 

Genetic Analysis: Genomic Comparison of Unrelated Strains. 
Microarrays allow efficient comparison of the genomes of dif- 
ferent strains. Genomic DNA from Y55, an S. cennsae strain 
divergent from the reference strain S288c, was randomly labeled 
with Qtf-dUTP and hybridized simultaneously with the S2SSc 
DNA labeled with Cy5-dUTP. When a comparison between the 
hybridization of the DNA from the two strains was done, several 



elements gave relatively little or no signal above background from 
7c^ 3 ^ ch J mncJ (data not ^0^). These include SGE1. 

ENA2), and YCR105. These results imply that the regions 
containing these genes are extremely divergent, or all together 
deleted from the strain. Subsequent attempts to generate PCR 
products from SGEl, ENA2, and ASP3A using Y55 DNA failed. 
This result supports the conclusion that these genes are likely to 
be missing from the Y55 genome. It is interesting to note that at 
least two of the regions absent in the Y55 genome have been 
previously shown or suggested to be deleted in mutant laboratory 
strains (14-16). In particular, the Asp-3 region appears to be 
nighty prone to being deleted (15, 16). 

These results indicate that gene arrays can be used to efficiently 
screen different strains of an organism for large deletion poly- 
morphisms. A single hybridization and scan will reveal differences 
based on differential hybridization to particular elements, It is 
reasonable to suppose that an equivalent number of genes are 
present in the Y55 genome and absent in the S288c genome. This 
result sh uld be viewed as a minimum estimate of the deletion 
polymorphisms that exist between these two unrelated strains as 
intergenic deletions or small intragenic deletions would not be 
detected because considerable hybridizing material would be 
remain. Sequence polymorphisms, such as deleti ns, are present 
in populati ns f every species and must at some level affect 
phen type. One f the challenges f the genome era will be to 
critically examine sequence polymorphisms that exist in the 
natural gene pool relative to the reference genome sequence. 
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Gene Expression Analysis. The arrays were used to examine 
gene expression in yeast grown under a variety of different 
conditions. Expression analysis is an idea] application of these 
arrays because a single hybridization provides quantitative expres- 

Table 2. Cold shock vs. control expression data 



Fio. 2. ORF categories displaying dif- 
ferential expression between heat shocked 
and untreated cultures. Ban within cate- 
gories correspond to individual ORFs. 
Green shaded bars correspond to relative 
increases in ORF expression under 25*C 
growth conditions. Red shaded bars cor- 
respond io relative increases in ORF ex- 
pression under 39°C growth conditions. 



sion data for thousands of genes. To better understand results for 
genes of known function, ORFs were placed in biologically rele- 
vant categories on the basis of function (e.g., amino acid catabolic 
genes) and/or pathways (e.g, the histidine biosynthesis pathway) 



Ratio of 
gene expression 



Control 

Z4 
2.9 

2.0 
2.1 
12 
2.7 
3.1 
3J 
3.8 
7.9 
7.9 
16\5 
1.8 
2.1 
2.4 
2.8 
3.4 



Cold 



1.4 



23 

1.7 

1.7 

1.7 

1.6 

1.6 

3.1 

2J2 

2.0 

4.2 



ORF 

YOR153 
YCR012 
YCL040 
YHR064 
YJL034 
YDR258 
YLL039 
YLL026 
YER103 
YBR126 
YPL240 
YBR054 
YBR072 
YCR021 
YDR343 
YHR096 
YFR053 
YHR092 
YHR094 
YHR089 
YLR048 
YLR441 
YLL045 
YLR029 
YGL123 
YBR067 
YER011 
YCR058 
YKL102 



Gene 

PDR5 
PGK1 
GLK1 

KAR2 
HSP78 
UBI4 
HSP104 
SSA4 
TPS1 
HSP82 
YR02 
HSP26 
HSP30 
HXT6 
HXT5 
HXK1 
HXT4 
HXT1 
GAR1 
NAB IB 
RP10A 
RPL4B 
RPL13A 
SUP44 
TIP1 
TIR1 



Description 



Plciotropic drug resistance protein " " — 

Phosphoglycerate kinase 
Aldohexose specific glucokinase 
Heat shock protein 
Nuclear fusion protein 

Mitochondrial heat shock protein of clpb family of ATP-dependent proteases 
ubiquitin precursor 
Heat shock protein 
Heat shock protein 

a, a-Trchalosc-phosphatc synthase (UDP-forming) 
Heat shock protein 

Similarity to HSP30 heat shock protein Yrolp 

Heat shock protein 

Heat shock protein 

High-affinity hexose transporter 

Putative hexose transporter 

Hexokinasc I 

Moderate- to low-affinity glucose transporter 
Low-affinity hexose (glucose) transporter 
Nucleolar rRNA processing protein 
40S ribosomal protein p40 homolog b 
Ribosomat protein S3a.c 
Ribosomal protein L7a.e.B 
RibosomaJ protein L15.e 
Ribosomal protein 

Cold- and heat-shock-induced protein of the Srpl/Tiplp family 
Cold-shock-induced protein of the Tirlp, Tiplp family 
Hypothetical protein 
Hypothetical protein 
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Table 1 Glucose vs. galactose expression data 
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3.5 


YDL067 
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YHR051 
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YFR033 
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YLR081 
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YBR018 
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YBR020 
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YDR009 
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$A 




YKR080 
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6J0 




YDR019 
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YLR058 
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8.1 
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PH084 


3-5 




YDR408 


ADE8 


3.6 




YDR408 


ADE8 


4.4 




YAR015 


ADE1 


5.6 




YMR300 


ADE4 
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YOR128 


ADE2 


6.0 




YGL234 


ADE5.7 




63 


YBL015 
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Description 

Arginosucanate lyase 
Glutamatc-ammonia 

Aminotriazole and 4-mtroquinoline resistance protein 
Alcohol dehydrogenase II 
Alcohol dehydrogenase V 
ADP, ATP carrier protein 2 
ADP, ATP carrier protein 
^^-transporting ATP synthase 6 chain precursor 
H* -transporting ATP synthase y chain precursor 
Lactate dehydrogenase cytochrome 62 
Lactate dehydrogenase cytochrome b2 
Cytochrome -W reductase 

Ubiquinol-cytochrome c reductase 44K core protein 
Cytochrome c oxidase chain VII A 
Cytochrome c oxidase, sub unit VIB 
Cytochrome c oxidase subunit VI 
Cytochrome c oxidase chain VIII 
Ubiqurool-cytochrome c reductase 17K protein 
Galactose (and glucose) permease 
UDP-glucose-hexose-l -phosphate uridylyitransferase 
GaJactokinase 
UDP-glucose 4-epimerase 
Galaaose (and glucose) permease 
GaJactokinase 

Negative regulator for expression of galactose-induced genes 
Negative regulator for expression of galactose-induced genes 
ATP phosphoribosyj transferase 
Glutamine amidotransferase/cydase 

Phosphoribosyl-AMP cyclohydrolasc/phosphoribosyl-ATP pyrophosphatasc/histidino! 
dehydrogenase 

Methylenetetrahydrofolate dehydrogenase (NAD+) 
Glycine decarboxylase T subunit 
Serine hydroxymethyltransferase 
High-affinity inorganic phosphate/H* symporter 
Phosphoribosylglycinamide formyhransferase (GART) 
Phosphoribosylglycinamidc formyltransfcrasc (GART) 
Phosphoribosylamidoimidazole-succinocarboxamidc synthase 
Amidophosphoribosyltransferase 
Phosphoribosylaminoimidazoie carboxylase 

Phosphoribosylamine-glycine ligasc and phosphoribosylformylglydnamidine cydo-ligase 
Acetyl-CoA hydrolase 



Heat Shock Results. A log phase culture growing in YEP/ 
dextrose medium at 25°C was split in half. One half of the 
culture remained at 25°C whereas the other half of the culture 
was shifted to 39°C mRNA was isolated from both cultures 1 h 
after heat shock for comparison on microarrays and, although 
this time point is not optimal for measuring induction of heat 
shock mRNAs (17), many known heat shock genes exhibited 
c nsiderable induction at this time point (Table 1; Fig. 2). 
Down-regulation of genes in the ribosomal protein and histone 
gene categories was also observed. Differentia] expression 
between the heat-shocked culture and the control was also 

bserved for many other genes. Genes in many categories, such 
as amino acid catabolism and amino acid synthesis, exhibited 
a mixed response with some genes showing little or no 
differential expression and ther genes showing a significant 
increase or decrease in gene expression in response t heat 
shock (Table 1; Fig. 2). 

Cold Shock Results. A log phase culture growing in YEP/ 
dextrose medium at 37°C was split in half. One half f the 
culture remained at 37°C while the other half f the culture was 
shifted t 18°C mRNA was isolated from both cultures 1 h 
after c Id sh ck f r c mparison on microarrays. As expected, 



two known cold shock genes (TIP1, TIR1) were expressed at 
a significantly higher level in the cold-shocked culture. Genes 
in other functional categories, such as glucose metabolism and 
heat shock displayed a mixed response with expression of some 
genes being unaffected and other genes exhibiting significant 
up- or down-regulation in response to cold shock (Table 2). 

Steady-State Galactose vs. Glucose Results. mRNA was 
isolated from steady-state log phase YEP galactose and YEP 
glucose grown cultures for comparison on the microarrays. As 
expected, the GAL genes were expressed at a much higher 
level in the galactose culture. Many genes were differentially 
expressed in these cultures that were not a priori expected to 
exhibit differential expression. For example, some genes in the 
amino acid catabolic category were up-regulated in the galac- 
t se culture whereas genes in the one-carbon metabolism and 
purine categories were largely or entirely down-regulated in 
the galaaose culture (Table 3). Genes in other categories, such 
as amino acid synthesis, abc transporter, cytochr me c, and 
cytochrome o, exhibited mixed responses; some genes in a 
category showed little r no obvi us differential expressi n 
whereas ther genes in the same category showed significant 
differential expressi n in the galaaose and glue se cultures. 
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DISCUSSION 
The results of these experiments show that many genes are 
differentially expressed under the three environmental condi- 
tions described here. The expected and predicted changes in gene 
expression, such as HSP12 in the heat-shocked culture, TTP1 in 
the cold-shocked culture, and GAL2 in the steady-state galactose 
culture, were observed in every case. However, in addition to the 
ex p e ct ed changes in gene expression, significant differential 
expression was also observed for many other genes that would 
not, a priori, be expected to be differentially expressed. For 
example, expression of PHOll decreased and expression of 
YLR194, KIN2, and HXT6 increased in the heat shocked culture. 
Expression of MST1 and APE3 decreased and expression of 
PDR5 and GAR1 increased in the cold-shocked culture. In 
addition, ADE4 and SER2 were expressed at reduced levels 
whereas PH084 and ACH1 were expressed at higher levels in 
cells grown in galactose compared with cells grown in glucose. 
Differential expression of these and many other genes was specific 
to one of these three environmental conditions. 

Many ther genes were found to be differentially expressed 
under more than one condition. When differentially expressed 
genes in cold- and beat-shocked cultures were compared, 30 
genes were found in common. Of these 30 genes, 28 showed 
inverse expression (ix, increased expression under one condition 
a^ffra** 1 expression under the other condition). Two genes, 
YCR058 and YKL102, showed elevated expression in response to 
both cold and heat shock. Fifteen genes were found to be 
differentially expressed in both the heat-shocked and steady-state 
galactose cultures: 9 genes showed increased expression and 5 
showed decreased expression under both conditions. Twenty 
genes were differentially expressed in both the cold-shocked and 
steady-state galactose cultures: 8 genes showed decreased expres- 
sion and 5 genes showed increased expression under both con- 
ditions. Six genes showed increased expression in the galactose 
culture and decreased expression in the cold shocked culture 
One gene (ODP1) showed increased expression in both the 
cold-shocked and steady-state galactose cultures. 

Gene expression is affected in a global fashion when environ- 
mental conditions are changed and both expected and unex- 
pected genes are affected. There is also overlap in the genes that 
are differentially expressed under quite different environmental 
conditions. These results can be rationalized by considering the 
high degree of cross-pathway regulation in yeast. For example, 
there is evidence for cross-pathway regulation between (i) carbon 
and nitrogen metabolism (18), (a) phosphate and sulfate metab- 
olism (19), and (Hi) purine, phosphate, and amino acid metabo- 
lism (20-24). There are also examples of the interaction of 
general and specific transcription factors (25, 26). Finally, within 
the broad class of amino add biosynthetic genes, there is evidence 
f r ammo acid specific regulation of some genes, regulation via 
general control for other genes, and regulation via both specific 
and general control for other genes (22, 27-30). 

Cross-pathway regulation arises from the complex structure 
of pr moters. Virtually all promoters contain sites for multiple 
transenpti n factors and, therefore, virtually all genes are 
subject to c mbinatoriaJ regulation. For example, the HIS4 
promoter contains binding sites for GCN4 (the general amino 
acid control transcription factor), PH02/BAS2 (a transcrip- 
tional regulator of phosphatase and purine biosynthetic 
genes), and BAS1 (a transcriptional regulator of purine bio- 
synthetic genes) (31). It is likely that the complex effects on 
gene expression described in this work are a direct conse- 
quence f the combinatorial regulati n f gene expressi n. 

These findings illustrate the power f the highly parallel whole 
genome approach when examining gene expression. The global 
effects of environmental change on gene expression can now be 
directly visualized. It is dear that determining the mechanism(s) 
and the functional role of the dramatic global effects on gene 
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expression in different errviranrnents wiD be a significant chal- 
lenge. The era of whole genome analysis wiD, ultimately allow 
researchers to switch from the very focused single ^c^aiota 
view of gene expression and instead view the cell more as a large 
complex network of gene regulatory pathways, 

With the entire sequence of this model organism known, new 
!5Z?£> £ ^ developed that allow for genome wide 
analyses (32, 33) of gene function. The genome rnicroarrays 
represent a novel tool for genetic and expression analysis of the 
yeast gencroe This pilot study uses arrays containing >35% of 
the yeast ORFs and it is clear that the entire set of ORFs tern 
the yeast genome can be arrayed using the directed primer based 
strategy detailed here. Recent advances in arravinff technnlnov 
will allow all 6,100 ORFs to be arrayed in 3£2lS 
cm furthermore, as the technology improves, detection limits 
will allow less than 500 ng of starting mRNA material to be used 
for making probe. 

The genome arrays provide for a robust, fully automated 
approach toward examining genome structure and gene func- 
tion. They allow for comparisons between different genomes 
as well as a detailed study of gene expression at the global level 
Tins research will help to elucidate relationships between 
genes and allow the researcher to understand gene function by 
understanding expression patterns across the yeast genome. 
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